Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 2002 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 999 |
| Duplicate rows (%) | 49.9% |
| Total size in memory | 359.9 KiB |
| Average record size in memory | 184.1 B |
Variable types
| Numeric | 21 |
|---|---|
| Categorical | 2 |
| Dataset has 999 (49.9%) duplicate rows | Duplicates |
PAY_0 is highly overall correlated with PAY_2 and 4 other fields | High correlation |
PAY_2 is highly overall correlated with PAY_0 and 4 other fields | High correlation |
PAY_3 is highly overall correlated with PAY_0 and 4 other fields | High correlation |
PAY_4 is highly overall correlated with PAY_0 and 4 other fields | High correlation |
PAY_5 is highly overall correlated with PAY_0 and 4 other fields | High correlation |
PAY_6 is highly overall correlated with PAY_0 and 4 other fields | High correlation |
BILL_AMT1 is highly overall correlated with LIMIT_BAL and 5 other fields | High correlation |
BILL_AMT2 is highly overall correlated with LIMIT_BAL and 5 other fields | High correlation |
BILL_AMT3 is highly overall correlated with LIMIT_BAL and 5 other fields | High correlation |
BILL_AMT4 is highly overall correlated with LIMIT_BAL and 6 other fields | High correlation |
BILL_AMT5 is highly overall correlated with BILL_AMT1 and 4 other fields | High correlation |
BILL_AMT6 is highly overall correlated with BILL_AMT1 and 4 other fields | High correlation |
PAY_AMT1 is highly overall correlated with PAY_AMT6 | High correlation |
PAY_AMT2 is highly overall correlated with PAY_AMT5 and 1 other fields | High correlation |
PAY_AMT3 is highly overall correlated with BILL_AMT4 | High correlation |
PAY_AMT4 is highly overall correlated with BILL_AMT4 | High correlation |
PAY_AMT5 is highly overall correlated with PAY_AMT2 | High correlation |
PAY_AMT6 is highly overall correlated with PAY_AMT1 and 1 other fields | High correlation |
LIMIT_BAL is highly overall correlated with BILL_AMT1 and 3 other fields | High correlation |
PAY_0 has 949 (47.4%) zeros | Zeros |
PAY_2 has 1053 (52.6%) zeros | Zeros |
PAY_3 has 1028 (51.3%) zeros | Zeros |
PAY_4 has 1080 (53.9%) zeros | Zeros |
PAY_5 has 1088 (54.3%) zeros | Zeros |
PAY_6 has 1001 (50.0%) zeros | Zeros |
BILL_AMT1 has 148 (7.4%) zeros | Zeros |
BILL_AMT2 has 198 (9.9%) zeros | Zeros |
BILL_AMT3 has 224 (11.2%) zeros | Zeros |
BILL_AMT4 has 261 (13.0%) zeros | Zeros |
BILL_AMT5 has 273 (13.6%) zeros | Zeros |
BILL_AMT6 has 307 (15.3%) zeros | Zeros |
PAY_AMT1 has 365 (18.2%) zeros | Zeros |
PAY_AMT2 has 408 (20.4%) zeros | Zeros |
PAY_AMT3 has 449 (22.4%) zeros | Zeros |
PAY_AMT4 has 463 (23.1%) zeros | Zeros |
PAY_AMT5 has 472 (23.6%) zeros | Zeros |
PAY_AMT6 has 546 (27.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-23 14:45:15.489168 |
|---|---|
| Analysis finished | 2023-02-23 14:46:03.804379 |
| Duration | 48.32 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
LIMIT_BAL
Real number (ℝ)
| Distinct | 56 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167087.91 |
| Minimum | 10000 |
|---|---|
| Maximum | 700000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 50000 |
| median | 140000 |
| Q3 | 240000 |
| 95-th percentile | 420000 |
| Maximum | 700000 |
| Range | 690000 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 130519.9 |
|---|---|
| Coefficient of variation (CV) | 0.78114506 |
| Kurtosis | 0.55443325 |
| Mean | 167087.91 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 1.0165149 |
| Sum | 3.3451 × 108 |
| Variance | 1.7035444 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 260 | 13.0% |
| 20000 | 117 | 5.8% |
| 30000 | 114 | 5.7% |
| 200000 | 103 | 5.1% |
| 80000 | 88 | 4.4% |
| 180000 | 72 | 3.6% |
| 360000 | 68 | 3.4% |
| 100000 | 66 | 3.3% |
| 140000 | 64 | 3.2% |
| 60000 | 58 | 2.9% |
| Other values (46) | 992 |
| Value | Count | Frequency (%) |
| 10000 | 26 | 1.3% |
| 20000 | 117 | |
| 30000 | 114 | |
| 40000 | 20 | 1.0% |
| 50000 | 260 | |
| 60000 | 58 | 2.9% |
| 70000 | 46 | 2.3% |
| 80000 | 88 | 4.4% |
| 90000 | 51 | 2.5% |
| 100000 | 66 | 3.3% |
| Value | Count | Frequency (%) |
| 700000 | 2 | 0.1% |
| 630000 | 4 | 0.2% |
| 620000 | 2 | 0.1% |
| 610000 | 2 | 0.1% |
| 600000 | 2 | 0.1% |
| 580000 | 2 | 0.1% |
| 510000 | 4 | 0.2% |
| 500000 | 44 | |
| 490000 | 4 | 0.2% |
| 480000 | 4 | 0.2% |
SEX
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2002 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2002 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2002 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1182 | |
| 1 | 820 |
EDUCATION
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7762238 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.74939531 |
|---|---|
| Coefficient of variation (CV) | 0.42190366 |
| Kurtosis | 1.7330872 |
| Mean | 1.7762238 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.87584311 |
| Sum | 3556 |
| Variance | 0.56159333 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 898 | |
| 1 | 790 | |
| 3 | 300 | 15.0% |
| 5 | 6 | 0.3% |
| 4 | 4 | 0.2% |
| 6 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 790 | |
| 2 | 898 | |
| 3 | 300 | 15.0% |
| 4 | 4 | 0.2% |
| 5 | 6 | 0.3% |
| 6 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 6 | 4 | 0.2% |
| 5 | 6 | 0.3% |
| 4 | 4 | 0.2% |
| 3 | 300 | 15.0% |
| 2 | 898 | |
| 1 | 790 |
MARRIAGE
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 38 |
| 0 | 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2002 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2002 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2002 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1139 | |
| 1 | 819 | |
| 3 | 38 | 1.9% |
| 0 | 6 | 0.3% |
AGE
Real number (ℝ)
| Distinct | 44 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.941059 |
| Minimum | 21 |
|---|---|
| Maximum | 75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 33 |
| Q3 | 41 |
| 95-th percentile | 53 |
| Maximum | 75 |
| Range | 54 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.2197625 |
|---|---|
| Coefficient of variation (CV) | 0.26386614 |
| Kurtosis | 0.23296966 |
| Mean | 34.941059 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.81664579 |
| Sum | 69952 |
| Variance | 85.00402 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 114 | 5.7% |
| 27 | 114 | 5.7% |
| 34 | 96 | 4.8% |
| 28 | 95 | 4.7% |
| 30 | 94 | 4.7% |
| 32 | 92 | 4.6% |
| 24 | 87 | 4.3% |
| 26 | 83 | 4.1% |
| 31 | 78 | 3.9% |
| 25 | 74 | 3.7% |
| Other values (34) | 1075 |
| Value | Count | Frequency (%) |
| 21 | 2 | 0.1% |
| 22 | 54 | |
| 23 | 70 | |
| 24 | 87 | |
| 25 | 74 | |
| 26 | 83 | |
| 27 | 114 | |
| 28 | 95 | |
| 29 | 114 | |
| 30 | 94 |
| Value | Count | Frequency (%) |
| 75 | 2 | 0.1% |
| 73 | 2 | 0.1% |
| 63 | 2 | 0.1% |
| 61 | 2 | 0.1% |
| 60 | 6 | 0.3% |
| 59 | 6 | 0.3% |
| 58 | 12 | |
| 57 | 12 | |
| 56 | 20 | |
| 55 | 14 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.002997003 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 949 |
| Zeros (%) | 47.4% |
| Negative | 586 |
| Negative (%) | 29.3% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1718807 |
|---|---|
| Coefficient of variation (CV) | -391.01752 |
| Kurtosis | 8.0908004 |
| Mean | -0.002997003 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.5146319 |
| Sum | -6 |
| Variance | 1.3733044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 949 | |
| -1 | 428 | |
| 1 | 272 | 13.6% |
| 2 | 167 | 8.3% |
| -2 | 158 | 7.9% |
| 3 | 12 | 0.6% |
| 4 | 8 | 0.4% |
| 8 | 8 | 0.4% |
| Value | Count | Frequency (%) |
| -2 | 158 | 7.9% |
| -1 | 428 | |
| 0 | 949 | |
| 1 | 272 | 13.6% |
| 2 | 167 | 8.3% |
| 3 | 12 | 0.6% |
| 4 | 8 | 0.4% |
| 8 | 8 | 0.4% |
| Value | Count | Frequency (%) |
| 8 | 8 | 0.4% |
| 4 | 8 | 0.4% |
| 3 | 12 | 0.6% |
| 2 | 167 | 8.3% |
| 1 | 272 | 13.6% |
| 0 | 949 | |
| -1 | 428 | |
| -2 | 158 | 7.9% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.15534466 |
| Minimum | -2 |
|---|---|
| Maximum | 7 |
| Zeros | 1053 |
| Zeros (%) | 52.6% |
| Negative | 671 |
| Negative (%) | 33.5% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2274325 |
|---|---|
| Coefficient of variation (CV) | -7.9013502 |
| Kurtosis | 4.3142633 |
| Mean | -0.15534466 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2068973 |
| Sum | -311 |
| Variance | 1.5065906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1053 | |
| -1 | 411 | 20.5% |
| -2 | 260 | 13.0% |
| 2 | 248 | 12.4% |
| 3 | 16 | 0.8% |
| 7 | 8 | 0.4% |
| 5 | 2 | 0.1% |
| 4 | 2 | 0.1% |
| 1 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| -2 | 260 | 13.0% |
| -1 | 411 | 20.5% |
| 0 | 1053 | |
| 1 | 2 | 0.1% |
| 2 | 248 | 12.4% |
| 3 | 16 | 0.8% |
| 4 | 2 | 0.1% |
| 5 | 2 | 0.1% |
| 7 | 8 | 0.4% |
| Value | Count | Frequency (%) |
| 7 | 8 | 0.4% |
| 5 | 2 | 0.1% |
| 4 | 2 | 0.1% |
| 3 | 16 | 0.8% |
| 2 | 248 | 12.4% |
| 1 | 2 | 0.1% |
| 0 | 1053 | |
| -1 | 411 | 20.5% |
| -2 | 260 | 13.0% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.16083916 |
| Minimum | -2 |
|---|---|
| Maximum | 7 |
| Zeros | 1028 |
| Zeros (%) | 51.3% |
| Negative | 690 |
| Negative (%) | 34.5% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2594887 |
|---|---|
| Coefficient of variation (CV) | -7.830734 |
| Kurtosis | 3.9731707 |
| Mean | -0.16083916 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2303656 |
| Sum | -322 |
| Variance | 1.5863117 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1028 | |
| -1 | 416 | |
| -2 | 274 | 13.7% |
| 2 | 258 | 12.9% |
| 4 | 8 | 0.4% |
| 6 | 8 | 0.4% |
| 7 | 4 | 0.2% |
| 3 | 2 | 0.1% |
| 1 | 2 | 0.1% |
| 5 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| -2 | 274 | 13.7% |
| -1 | 416 | |
| 0 | 1028 | |
| 1 | 2 | 0.1% |
| 2 | 258 | 12.9% |
| 3 | 2 | 0.1% |
| 4 | 8 | 0.4% |
| 5 | 2 | 0.1% |
| 6 | 8 | 0.4% |
| 7 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 7 | 4 | 0.2% |
| 6 | 8 | 0.4% |
| 5 | 2 | 0.1% |
| 4 | 8 | 0.4% |
| 3 | 2 | 0.1% |
| 2 | 258 | 12.9% |
| 1 | 2 | 0.1% |
| 0 | 1028 | |
| -1 | 416 | |
| -2 | 274 | 13.7% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.27972028 |
| Minimum | -2 |
|---|---|
| Maximum | 7 |
| Zeros | 1080 |
| Zeros (%) | 53.9% |
| Negative | 720 |
| Negative (%) | 36.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.181939 |
|---|---|
| Coefficient of variation (CV) | -4.225432 |
| Kurtosis | 4.4573116 |
| Mean | -0.27972028 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.219633 |
| Sum | -560 |
| Variance | 1.3969798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1080 | |
| -1 | 408 | 20.4% |
| -2 | 312 | 15.6% |
| 2 | 174 | 8.7% |
| 3 | 10 | 0.5% |
| 5 | 10 | 0.5% |
| 4 | 4 | 0.2% |
| 7 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| -2 | 312 | 15.6% |
| -1 | 408 | 20.4% |
| 0 | 1080 | |
| 2 | 174 | 8.7% |
| 3 | 10 | 0.5% |
| 4 | 4 | 0.2% |
| 5 | 10 | 0.5% |
| 7 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 7 | 4 | 0.2% |
| 5 | 10 | 0.5% |
| 4 | 4 | 0.2% |
| 3 | 10 | 0.5% |
| 2 | 174 | 8.7% |
| 0 | 1080 | |
| -1 | 408 | 20.4% |
| -2 | 312 | 15.6% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.28021978 |
| Minimum | -2 |
|---|---|
| Maximum | 7 |
| Zeros | 1088 |
| Zeros (%) | 54.3% |
| Negative | 708 |
| Negative (%) | 35.4% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1679967 |
|---|---|
| Coefficient of variation (CV) | -4.168145 |
| Kurtosis | 3.75307 |
| Mean | -0.28021978 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.0535828 |
| Sum | -561 |
| Variance | 1.3642162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1088 | |
| -1 | 387 | 19.3% |
| -2 | 321 | 16.0% |
| 2 | 180 | 9.0% |
| 3 | 10 | 0.5% |
| 4 | 10 | 0.5% |
| 7 | 4 | 0.2% |
| 5 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| -2 | 321 | 16.0% |
| -1 | 387 | 19.3% |
| 0 | 1088 | |
| 2 | 180 | 9.0% |
| 3 | 10 | 0.5% |
| 4 | 10 | 0.5% |
| 5 | 2 | 0.1% |
| 7 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 7 | 4 | 0.2% |
| 5 | 2 | 0.1% |
| 4 | 10 | 0.5% |
| 3 | 10 | 0.5% |
| 2 | 180 | 9.0% |
| 0 | 1088 | |
| -1 | 387 | 19.3% |
| -2 | 321 | 16.0% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.31018981 |
| Minimum | -2 |
|---|---|
| Maximum | 7 |
| Zeros | 1001 |
| Zeros (%) | 50.0% |
| Negative | 778 |
| Negative (%) | 38.9% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2022742 |
|---|---|
| Coefficient of variation (CV) | -3.8759308 |
| Kurtosis | 3.3173686 |
| Mean | -0.31018981 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 1.0622789 |
| Sum | -621 |
| Variance | 1.4454633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1001 | |
| -1 | 435 | |
| -2 | 343 | 17.1% |
| 2 | 197 | 9.8% |
| 3 | 16 | 0.8% |
| 6 | 6 | 0.3% |
| 4 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| -2 | 343 | 17.1% |
| -1 | 435 | |
| 0 | 1001 | |
| 2 | 197 | 9.8% |
| 3 | 16 | 0.8% |
| 4 | 2 | 0.1% |
| 6 | 6 | 0.3% |
| 7 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 7 | 2 | 0.1% |
| 6 | 6 | 0.3% |
| 4 | 2 | 0.1% |
| 3 | 16 | 0.8% |
| 2 | 197 | 9.8% |
| 0 | 1001 | |
| -1 | 435 | |
| -2 | 343 | 17.1% |
| Distinct | 907 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49422.346 |
| Minimum | -14386 |
|---|---|
| Maximum | 507726 |
| Zeros | 148 |
| Zeros (%) | 7.4% |
| Negative | 44 |
| Negative (%) | 2.2% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -14386 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3136 |
| median | 21229 |
| Q3 | 59801.75 |
| 95-th percentile | 199436 |
| Maximum | 507726 |
| Range | 522112 |
| Interquartile range (IQR) | 56665.75 |
Descriptive statistics
| Standard deviation | 72613.583 |
|---|---|
| Coefficient of variation (CV) | 1.469246 |
| Kurtosis | 8.9570767 |
| Mean | 49422.346 |
| Median Absolute Deviation (MAD) | 20831.5 |
| Skewness | 2.6708569 |
| Sum | 98943537 |
| Variance | 5.2727324 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 148 | 7.4% |
| 390 | 16 | 0.8% |
| 780 | 8 | 0.4% |
| 396 | 6 | 0.3% |
| 316 | 6 | 0.3% |
| 5780 | 4 | 0.2% |
| 2000 | 4 | 0.2% |
| 819 | 4 | 0.2% |
| -200 | 4 | 0.2% |
| 650 | 4 | 0.2% |
| Other values (897) | 1798 |
| Value | Count | Frequency (%) |
| -14386 | 2 | |
| -2000 | 2 | |
| -1312 | 2 | |
| -1100 | 2 | |
| -946 | 2 | |
| -709 | 2 | |
| -475 | 2 | |
| -288 | 2 | |
| -200 | 4 | |
| -190 | 2 |
| Value | Count | Frequency (%) |
| 507726 | 2 | |
| 507062 | 2 | |
| 471814 | 2 | |
| 467150 | 2 | |
| 422069 | 2 | |
| 400134 | 2 | |
| 386405 | 2 | |
| 367965 | 2 | |
| 366193 | 2 | |
| 355215 | 2 |
| Distinct | 878 |
|---|---|
| Distinct (%) | 43.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47891.357 |
| Minimum | -13543 |
|---|---|
| Maximum | 509229 |
| Zeros | 198 |
| Zeros (%) | 9.9% |
| Negative | 47 |
| Negative (%) | 2.3% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -13543 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3275.5 |
| median | 20408.5 |
| Q3 | 58417.75 |
| 95-th percentile | 196143 |
| Maximum | 509229 |
| Range | 522772 |
| Interquartile range (IQR) | 55142.25 |
Descriptive statistics
| Standard deviation | 72055.49 |
|---|---|
| Coefficient of variation (CV) | 1.5045615 |
| Kurtosis | 9.6659181 |
| Mean | 47891.357 |
| Median Absolute Deviation (MAD) | 20092.5 |
| Skewness | 2.7766944 |
| Sum | 95878497 |
| Variance | 5.1919937 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 198 | 9.9% |
| 390 | 10 | 0.5% |
| 780 | 8 | 0.4% |
| 316 | 8 | 0.4% |
| 300 | 8 | 0.4% |
| 396 | 6 | 0.3% |
| -200 | 6 | 0.3% |
| 1261 | 6 | 0.3% |
| 291 | 6 | 0.3% |
| 100 | 4 | 0.2% |
| Other values (868) | 1742 |
| Value | Count | Frequency (%) |
| -13543 | 2 | |
| -9850 | 2 | |
| -1100 | 2 | |
| -1041 | 2 | |
| -946 | 2 | |
| -818 | 1 | |
| -709 | 2 | |
| -707 | 2 | |
| -425 | 2 | |
| -303 | 2 |
| Value | Count | Frequency (%) |
| 509229 | 2 | |
| 491956 | 2 | |
| 478380 | 2 | |
| 458862 | 2 | |
| 431342 | 2 | |
| 412023 | 2 | |
| 398857 | 2 | |
| 387910 | 2 | |
| 372700 | 2 | |
| 363325 | 2 |
| Distinct | 866 |
|---|---|
| Distinct (%) | 43.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44981.964 |
| Minimum | -9850 |
|---|---|
| Maximum | 499936 |
| Zeros | 224 |
| Zeros (%) | 11.2% |
| Negative | 44 |
| Negative (%) | 2.2% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -9850 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1947 |
| median | 19298 |
| Q3 | 54477 |
| 95-th percentile | 186292 |
| Maximum | 499936 |
| Range | 509786 |
| Interquartile range (IQR) | 52530 |
Descriptive statistics
| Standard deviation | 69510.626 |
|---|---|
| Coefficient of variation (CV) | 1.5452999 |
| Kurtosis | 10.605545 |
| Mean | 44981.964 |
| Median Absolute Deviation (MAD) | 18908 |
| Skewness | 2.8994823 |
| Sum | 90053892 |
| Variance | 4.8317271 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 224 | 11.2% |
| 390 | 16 | 0.8% |
| 396 | 6 | 0.3% |
| 316 | 6 | 0.3% |
| -2 | 6 | 0.3% |
| 780 | 6 | 0.3% |
| 664 | 4 | 0.2% |
| 325 | 4 | 0.2% |
| 1350 | 4 | 0.2% |
| 13001 | 4 | 0.2% |
| Other values (856) | 1722 |
| Value | Count | Frequency (%) |
| -9850 | 2 | |
| -2697 | 2 | |
| -1690 | 2 | |
| -946 | 2 | |
| -709 | 2 | |
| -684 | 2 | |
| -527 | 2 | |
| -387 | 2 | |
| -288 | 2 | |
| -281 | 2 |
| Value | Count | Frequency (%) |
| 499936 | 2 | |
| 479432 | 2 | |
| 469703 | 2 | |
| 445007 | 2 | |
| 430637 | 2 | |
| 404205 | 2 | |
| 395612 | 2 | |
| 375948 | 2 | |
| 375070 | 2 | |
| 373181 | 2 |
| Distinct | 848 |
|---|---|
| Distinct (%) | 42.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40741.321 |
| Minimum | -3684 |
|---|---|
| Maximum | 628699 |
| Zeros | 261 |
| Zeros (%) | 13.0% |
| Negative | 44 |
| Negative (%) | 2.2% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -3684 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1438 |
| median | 17743 |
| Q3 | 48800 |
| 95-th percentile | 167163 |
| Maximum | 628699 |
| Range | 632383 |
| Interquartile range (IQR) | 47362 |
Descriptive statistics
| Standard deviation | 68166.982 |
|---|---|
| Coefficient of variation (CV) | 1.6731657 |
| Kurtosis | 17.846334 |
| Mean | 40741.321 |
| Median Absolute Deviation (MAD) | 17328 |
| Skewness | 3.5793917 |
| Sum | 81564124 |
| Variance | 4.6467374 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 261 | 13.0% |
| 390 | 14 | 0.7% |
| 316 | 10 | 0.5% |
| 300 | 6 | 0.3% |
| 2340 | 4 | 0.2% |
| 362 | 4 | 0.2% |
| 5400 | 4 | 0.2% |
| 240 | 4 | 0.2% |
| 5818 | 4 | 0.2% |
| -2 | 4 | 0.2% |
| Other values (838) | 1687 |
| Value | Count | Frequency (%) |
| -3684 | 2 | |
| -2898 | 2 | |
| -2618 | 2 | |
| -946 | 2 | |
| -923 | 2 | |
| -828 | 2 | |
| -810 | 2 | |
| -387 | 2 | |
| -288 | 2 | |
| -281 | 2 |
| Value | Count | Frequency (%) |
| 628699 | 2 | |
| 542653 | 2 | |
| 505507 | 2 | |
| 487066 | 2 | |
| 479978 | 2 | |
| 447130 | 2 | |
| 386295 | 2 | |
| 376657 | 2 | |
| 360199 | 2 | |
| 354839 | 2 |
| Distinct | 836 |
|---|---|
| Distinct (%) | 41.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39071.871 |
| Minimum | -28335 |
|---|---|
| Maximum | 484612 |
| Zeros | 273 |
| Zeros (%) | 13.6% |
| Negative | 52 |
| Negative (%) | 2.6% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -28335 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1254 |
| median | 17591.5 |
| Q3 | 46361.75 |
| 95-th percentile | 165725 |
| Maximum | 484612 |
| Range | 512947 |
| Interquartile range (IQR) | 45107.75 |
Descriptive statistics
| Standard deviation | 63062.665 |
|---|---|
| Coefficient of variation (CV) | 1.614017 |
| Kurtosis | 12.842305 |
| Mean | 39071.871 |
| Median Absolute Deviation (MAD) | 17175.5 |
| Skewness | 3.1093559 |
| Sum | 78221886 |
| Variance | 3.9768997 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 273 | 13.6% |
| 390 | 16 | 0.8% |
| 2000 | 6 | 0.3% |
| 316 | 6 | 0.3% |
| 396 | 6 | 0.3% |
| 150 | 6 | 0.3% |
| 688 | 4 | 0.2% |
| 19450 | 4 | 0.2% |
| 166 | 4 | 0.2% |
| 19323 | 4 | 0.2% |
| Other values (826) | 1673 |
| Value | Count | Frequency (%) |
| -28335 | 2 | |
| -5000 | 2 | |
| -3272 | 2 | |
| -1488 | 2 | |
| -1005 | 2 | |
| -946 | 2 | |
| -783 | 2 | |
| -679 | 2 | |
| -527 | 2 | |
| -420 | 2 |
| Value | Count | Frequency (%) |
| 484612 | 2 | |
| 483003 | 2 | |
| 471145 | 2 | |
| 440982 | 2 | |
| 369532 | 2 | |
| 356656 | 2 | |
| 356636 | 2 | |
| 356206 | 2 | |
| 335760 | 2 | |
| 315820 | 2 |
| Distinct | 824 |
|---|---|
| Distinct (%) | 41.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38056.488 |
| Minimum | -339603 |
|---|---|
| Maximum | 473944 |
| Zeros | 307 |
| Zeros (%) | 15.3% |
| Negative | 36 |
| Negative (%) | 1.8% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | -339603 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 869 |
| median | 15874 |
| Q3 | 46557 |
| 95-th percentile | 167964 |
| Maximum | 473944 |
| Range | 813547 |
| Interquartile range (IQR) | 45688 |
Descriptive statistics
| Standard deviation | 63040.633 |
|---|---|
| Coefficient of variation (CV) | 1.6565016 |
| Kurtosis | 12.144814 |
| Mean | 38056.488 |
| Median Absolute Deviation (MAD) | 15664.5 |
| Skewness | 2.635236 |
| Sum | 76189089 |
| Variance | 3.9741214 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 307 | 15.3% |
| 390 | 16 | 0.8% |
| 150 | 8 | 0.4% |
| 316 | 8 | 0.4% |
| 291 | 6 | 0.3% |
| 1320 | 6 | 0.3% |
| 780 | 5 | 0.2% |
| 830 | 4 | 0.2% |
| 199 | 4 | 0.2% |
| -200 | 4 | 0.2% |
| Other values (814) | 1634 |
| Value | Count | Frequency (%) |
| -339603 | 2 | |
| -3272 | 2 | |
| -1884 | 2 | |
| -946 | 2 | |
| -780 | 2 | |
| -304 | 2 | |
| -281 | 2 | |
| -246 | 2 | |
| -200 | 4 | |
| -189 | 2 |
| Value | Count | Frequency (%) |
| 473944 | 2 | |
| 469961 | 2 | |
| 434715 | 2 | |
| 419643 | 2 | |
| 367399 | 2 | |
| 364089 | 2 | |
| 352257 | 2 | |
| 330121 | 2 | |
| 309959 | 2 | |
| 305498 | 2 |
| Distinct | 522 |
|---|---|
| Distinct (%) | 26.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5373.7023 |
| Minimum | 0 |
|---|---|
| Maximum | 199646 |
| Zeros | 365 |
| Zeros (%) | 18.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1000 |
| median | 2160 |
| Q3 | 5085 |
| 95-th percentile | 20000 |
| Maximum | 199646 |
| Range | 199646 |
| Interquartile range (IQR) | 4085 |
Descriptive statistics
| Standard deviation | 12177.441 |
|---|---|
| Coefficient of variation (CV) | 2.2661175 |
| Kurtosis | 88.154436 |
| Mean | 5373.7023 |
| Median Absolute Deviation (MAD) | 1926 |
| Skewness | 7.7466708 |
| Sum | 10758152 |
| Variance | 1.4829006 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 365 | 18.2% |
| 2000 | 80 | 4.0% |
| 3000 | 64 | 3.2% |
| 2500 | 40 | 2.0% |
| 10000 | 38 | 1.9% |
| 5000 | 33 | 1.6% |
| 1000 | 32 | 1.6% |
| 1500 | 26 | 1.3% |
| 4000 | 22 | 1.1% |
| 1800 | 18 | 0.9% |
| Other values (512) | 1284 |
| Value | Count | Frequency (%) |
| 0 | 365 | |
| 1 | 2 | 0.1% |
| 39 | 4 | 0.2% |
| 92 | 2 | 0.1% |
| 100 | 2 | 0.1% |
| 105 | 2 | 0.1% |
| 131 | 2 | 0.1% |
| 138 | 2 | 0.1% |
| 157 | 2 | 0.1% |
| 165 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 199646 | 2 | |
| 120093 | 2 | |
| 120041 | 2 | |
| 90000 | 2 | |
| 81690 | 2 | |
| 80000 | 4 | |
| 70010 | 2 | |
| 67650 | 2 | |
| 57087 | 2 | |
| 55000 | 2 |
| Distinct | 521 |
|---|---|
| Distinct (%) | 26.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5049.9426 |
| Minimum | 0 |
|---|---|
| Maximum | 285138 |
| Zeros | 408 |
| Zeros (%) | 20.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 390 |
| median | 1700 |
| Q3 | 4500 |
| 95-th percentile | 16025 |
| Maximum | 285138 |
| Range | 285138 |
| Interquartile range (IQR) | 4110 |
Descriptive statistics
| Standard deviation | 15622.382 |
|---|---|
| Coefficient of variation (CV) | 3.0935761 |
| Kurtosis | 150.69656 |
| Mean | 5049.9426 |
| Median Absolute Deviation (MAD) | 1700 |
| Skewness | 10.744887 |
| Sum | 10109985 |
| Variance | 2.4405881 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 408 | 20.4% |
| 2000 | 57 | 2.8% |
| 1500 | 54 | 2.7% |
| 5000 | 54 | 2.7% |
| 3000 | 54 | 2.7% |
| 1000 | 49 | 2.4% |
| 1600 | 24 | 1.2% |
| 1400 | 20 | 1.0% |
| 1200 | 20 | 1.0% |
| 6000 | 18 | 0.9% |
| Other values (511) | 1244 |
| Value | Count | Frequency (%) |
| 0 | 408 | |
| 1 | 2 | 0.1% |
| 2 | 4 | 0.2% |
| 3 | 2 | 0.1% |
| 5 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| 10 | 2 | 0.1% |
| 11 | 2 | 0.1% |
| 12 | 2 | 0.1% |
| 15 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 285138 | 2 | |
| 199982 | 2 | |
| 177671 | 2 | |
| 145000 | 2 | |
| 104279 | 2 | |
| 88678 | 2 | |
| 84440 | 2 | |
| 75720 | 2 | |
| 55693 | 2 | |
| 52110 | 2 |
| Distinct | 495 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4130.6553 |
| Minimum | 0 |
|---|---|
| Maximum | 133657 |
| Zeros | 449 |
| Zeros (%) | 22.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 229.25 |
| median | 1200 |
| Q3 | 3715 |
| 95-th percentile | 14324.3 |
| Maximum | 133657 |
| Range | 133657 |
| Interquartile range (IQR) | 3485.75 |
Descriptive statistics
| Standard deviation | 10340.006 |
|---|---|
| Coefficient of variation (CV) | 2.5032361 |
| Kurtosis | 61.655392 |
| Mean | 4130.6553 |
| Median Absolute Deviation (MAD) | 1200 |
| Skewness | 6.8182985 |
| Sum | 8269572 |
| Variance | 1.0691572 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 449 | 22.4% |
| 1000 | 103 | 5.1% |
| 2000 | 76 | 3.8% |
| 3000 | 70 | 3.5% |
| 5000 | 44 | 2.2% |
| 1500 | 24 | 1.2% |
| 6000 | 20 | 1.0% |
| 10000 | 20 | 1.0% |
| 500 | 18 | 0.9% |
| 4000 | 18 | 0.9% |
| Other values (485) | 1160 |
| Value | Count | Frequency (%) |
| 0 | 449 | |
| 3 | 2 | 0.1% |
| 27 | 2 | 0.1% |
| 28 | 2 | 0.1% |
| 50 | 2 | 0.1% |
| 54 | 2 | 0.1% |
| 87 | 2 | 0.1% |
| 91 | 2 | 0.1% |
| 100 | 2 | 0.1% |
| 116 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 133657 | 2 | |
| 130000 | 2 | |
| 89000 | 1 | |
| 80000 | 2 | |
| 75940 | 2 | |
| 74354 | 2 | |
| 68454 | 2 | |
| 65840 | 2 | |
| 62520 | 2 | |
| 61411 | 2 |
| Distinct | 482 |
|---|---|
| Distinct (%) | 24.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4669.7033 |
| Minimum | 0 |
|---|---|
| Maximum | 188840 |
| Zeros | 463 |
| Zeros (%) | 23.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 150 |
| median | 1380 |
| Q3 | 4000 |
| 95-th percentile | 17000 |
| Maximum | 188840 |
| Range | 188840 |
| Interquartile range (IQR) | 3850 |
Descriptive statistics
| Standard deviation | 13266.465 |
|---|---|
| Coefficient of variation (CV) | 2.8409654 |
| Kurtosis | 70.355758 |
| Mean | 4669.7033 |
| Median Absolute Deviation (MAD) | 1380 |
| Skewness | 7.4498753 |
| Sum | 9348746 |
| Variance | 1.7599911 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 463 | |
| 1000 | 89 | 4.4% |
| 2000 | 70 | 3.5% |
| 5000 | 48 | 2.4% |
| 3000 | 48 | 2.4% |
| 1500 | 36 | 1.8% |
| 4000 | 32 | 1.6% |
| 500 | 26 | 1.3% |
| 2500 | 22 | 1.1% |
| 10000 | 16 | 0.8% |
| Other values (472) | 1152 |
| Value | Count | Frequency (%) |
| 0 | 463 | |
| 6 | 6 | 0.3% |
| 7 | 2 | 0.1% |
| 17 | 2 | 0.1% |
| 25 | 2 | 0.1% |
| 64 | 2 | 0.1% |
| 69 | 2 | 0.1% |
| 74 | 2 | 0.1% |
| 92 | 2 | 0.1% |
| 98 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 188840 | 2 | |
| 146900 | 2 | |
| 107591 | 2 | |
| 100000 | 4 | |
| 99669 | 2 | |
| 99000 | 2 | |
| 97441 | 2 | |
| 88348 | 2 | |
| 80552 | 2 | |
| 79377 | 2 |
| Distinct | 481 |
|---|---|
| Distinct (%) | 24.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5332.6818 |
| Minimum | 0 |
|---|---|
| Maximum | 195599 |
| Zeros | 472 |
| Zeros (%) | 23.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 196.75 |
| median | 1306 |
| Q3 | 3745 |
| 95-th percentile | 17000 |
| Maximum | 195599 |
| Range | 195599 |
| Interquartile range (IQR) | 3548.25 |
Descriptive statistics
| Standard deviation | 16807.872 |
|---|---|
| Coefficient of variation (CV) | 3.151861 |
| Kurtosis | 58.094076 |
| Mean | 5332.6818 |
| Median Absolute Deviation (MAD) | 1306 |
| Skewness | 7.0296679 |
| Sum | 10676029 |
| Variance | 2.8250456 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 472 | |
| 1000 | 84 | 4.2% |
| 3000 | 70 | 3.5% |
| 2000 | 64 | 3.2% |
| 1500 | 48 | 2.4% |
| 5000 | 38 | 1.9% |
| 4000 | 24 | 1.2% |
| 500 | 18 | 0.9% |
| 1200 | 16 | 0.8% |
| 3500 | 16 | 0.8% |
| Other values (471) | 1152 |
| Value | Count | Frequency (%) |
| 0 | 472 | |
| 12 | 2 | 0.1% |
| 60 | 2 | 0.1% |
| 91 | 1 | < 0.1% |
| 100 | 2 | 0.1% |
| 150 | 8 | 0.4% |
| 160 | 2 | 0.1% |
| 162 | 2 | 0.1% |
| 169 | 2 | 0.1% |
| 175 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 195599 | 2 | 0.1% |
| 184922 | 2 | 0.1% |
| 162000 | 2 | 0.1% |
| 160719 | 2 | 0.1% |
| 133841 | 2 | 0.1% |
| 132200 | 2 | 0.1% |
| 130291 | 2 | 0.1% |
| 101005 | 2 | 0.1% |
| 100000 | 6 | |
| 85900 | 2 | 0.1% |
| Distinct | 436 |
|---|---|
| Distinct (%) | 21.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5096.9461 |
| Minimum | 0 |
|---|---|
| Maximum | 528666 |
| Zeros | 546 |
| Zeros (%) | 27.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1261 |
| Q3 | 3800 |
| 95-th percentile | 13770 |
| Maximum | 528666 |
| Range | 528666 |
| Interquartile range (IQR) | 3800 |
Descriptive statistics
| Standard deviation | 23652.198 |
|---|---|
| Coefficient of variation (CV) | 4.6404647 |
| Kurtosis | 289.2105 |
| Mean | 5096.9461 |
| Median Absolute Deviation (MAD) | 1261 |
| Skewness | 15.230827 |
| Sum | 10204086 |
| Variance | 5.5942649 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 546 | |
| 2000 | 101 | 5.0% |
| 1000 | 100 | 5.0% |
| 3000 | 56 | 2.8% |
| 5000 | 52 | 2.6% |
| 2500 | 30 | 1.5% |
| 1500 | 30 | 1.5% |
| 4000 | 24 | 1.2% |
| 10000 | 24 | 1.2% |
| 6000 | 18 | 0.9% |
| Other values (426) | 1021 |
| Value | Count | Frequency (%) |
| 0 | 546 | |
| 1 | 2 | 0.1% |
| 3 | 2 | 0.1% |
| 4 | 2 | 0.1% |
| 60 | 2 | 0.1% |
| 62 | 2 | 0.1% |
| 66 | 4 | 0.2% |
| 95 | 2 | 0.1% |
| 100 | 4 | 0.2% |
| 102 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 528666 | 2 | |
| 345293 | 2 | |
| 185652 | 2 | |
| 167000 | 2 | |
| 153504 | 2 | |
| 126685 | 2 | |
| 105700 | 2 | |
| 77195 | 2 | |
| 68978 | 2 | |
| 67619 | 2 |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 20000 | 2 | 2 | 1 | 24 | 2 | 2 | -1 | -1 | -2 | -2 | 3913 | 3102 | 689 | 0 | 0 | 0 | 0 | 689 | 0 | 0 | 0 | 0 |
| 1 | 120000 | 2 | 2 | 2 | 26 | -1 | 2 | 0 | 0 | 0 | 2 | 2682 | 1725 | 2682 | 3272 | 3455 | 3261 | 0 | 1000 | 1000 | 1000 | 0 | 2000 |
| 2 | 90000 | 2 | 2 | 2 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 29239 | 14027 | 13559 | 14331 | 14948 | 15549 | 1518 | 1500 | 1000 | 1000 | 1000 | 5000 |
| 3 | 50000 | 2 | 2 | 1 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 46990 | 48233 | 49291 | 28314 | 28959 | 29547 | 2000 | 2019 | 1200 | 1100 | 1069 | 1000 |
| 4 | 50000 | 1 | 2 | 1 | 57 | -1 | 0 | -1 | 0 | 0 | 0 | 8617 | 5670 | 35835 | 20940 | 19146 | 19131 | 2000 | 36681 | 10000 | 9000 | 689 | 679 |
| 5 | 50000 | 1 | 1 | 2 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 64400 | 57069 | 57608 | 19394 | 19619 | 20024 | 2500 | 1815 | 657 | 1000 | 1000 | 800 |
| 6 | 500000 | 1 | 1 | 2 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 367965 | 412023 | 445007 | 542653 | 483003 | 473944 | 55000 | 40000 | 38000 | 20239 | 13750 | 13770 |
| 7 | 100000 | 2 | 2 | 2 | 23 | 0 | -1 | -1 | 0 | 0 | -1 | 11876 | 380 | 601 | 221 | -159 | 567 | 380 | 601 | 0 | 581 | 1687 | 1542 |
| 8 | 140000 | 2 | 3 | 1 | 28 | 0 | 0 | 2 | 0 | 0 | 0 | 11285 | 14096 | 12108 | 12211 | 11793 | 3719 | 3329 | 0 | 432 | 1000 | 1000 | 1000 |
| 9 | 20000 | 1 | 3 | 2 | 35 | -2 | -2 | -2 | -2 | -1 | -1 | 0 | 0 | 0 | 0 | 13007 | 13912 | 0 | 0 | 0 | 13007 | 1122 | 0 |
| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1992 | 360000 | 2 | 1 | 2 | 25 | 0 | 0 | 0 | 0 | 0 | -2 | 279846 | 169426 | 68810 | 12800 | 0 | 0 | 7004 | 1793 | 2757 | 0 | 0 | 0 |
| 1993 | 290000 | 2 | 1 | 2 | 29 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1994 | 200000 | 1 | 1 | 2 | 39 | -2 | -2 | -2 | -2 | -2 | -2 | -200 | -200 | -200 | 0 | 60800 | 0 | 0 | 0 | 200 | 60800 | 0 | 0 |
| 1995 | 140000 | 1 | 1 | 1 | 45 | 0 | 0 | 0 | 0 | 2 | 2 | 39716 | 40799 | 41853 | 44452 | 45433 | 46383 | 1600 | 1600 | 3169 | 1700 | 1700 | 1495 |
| 1996 | 360000 | 1 | 1 | 1 | 38 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1997 | 50000 | 2 | 2 | 2 | 23 | -1 | -1 | -1 | 0 | -1 | -1 | 780 | 0 | 780 | 390 | 390 | 500 | 0 | 780 | 0 | 390 | 500 | 18300 |
| 1998 | 120000 | 1 | 2 | 2 | 25 | 2 | 2 | 0 | 0 | 0 | 0 | 113348 | 110119 | 111700 | 83858 | 86434 | 88802 | 0 | 5000 | 3158 | 3934 | 3802 | 2000 |
| 1999 | 100000 | 1 | 2 | 1 | 29 | 0 | 0 | 0 | 0 | -1 | -1 | 94453 | 95860 | 67782 | -2618 | 95748 | 101299 | 3320 | 5000 | 0 | 100000 | 7186 | 0 |
| 2000 | 200000 | 2 | 2 | 1 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 81865 | 86790 | 8441 | 97041 | 103541 | 3632 | 5000 | 2000 | 89000 | 6500 | 91 | 1504 |
| 2001 | 90000 | 2 | 2 | 1 | 40 | -1 | -1 | -1 | -1 | -1 | -1 | 4989 | -818 | 1114 | 657 | 1332 | 780 | 0 | 2806 | 2256 | 2274 | 780 | 0 |
Most frequently occurring
| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000 | 1 | 2 | 1 | 45 | 0 | 0 | 0 | 2 | 0 | 0 | 7139 | 8416 | 9815 | 9508 | 9754 | 10192 | 1400 | 1700 | 0 | 400 | 600 | 200 | 2 |
| 1 | 10000 | 1 | 2 | 1 | 56 | 2 | 2 | 2 | 0 | 0 | 0 | 2097 | 4193 | 3978 | 4062 | 4196 | 4326 | 2300 | 0 | 150 | 200 | 200 | 160 | 2 |
| 2 | 10000 | 1 | 2 | 2 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 1877 | 3184 | 6003 | 3576 | 3670 | 4451 | 1500 | 2927 | 1000 | 300 | 1000 | 500 | 2 |
| 3 | 10000 | 1 | 2 | 2 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 7960 | 9649 | 8518 | 8628 | 9293 | 5033 | 2000 | 1000 | 500 | 1500 | 0 | 2500 | 2 |
| 4 | 10000 | 1 | 2 | 2 | 24 | -1 | 2 | 2 | 2 | 0 | 0 | 2887 | 1923 | 2989 | 2813 | 2008 | 2132 | 0 | 1500 | 0 | 0 | 150 | 0 | 2 |
| 5 | 10000 | 1 | 2 | 2 | 27 | 0 | 0 | 2 | 0 | 0 | 0 | 7015 | 10227 | 9560 | 9901 | 9963 | 10182 | 3507 | 0 | 500 | 370 | 393 | 700 | 2 |
| 6 | 10000 | 1 | 2 | 2 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 8177 | 9131 | 9669 | 7624 | 8049 | 6857 | 2500 | 1145 | 1000 | 1000 | 1000 | 1500 | 2 |
| 7 | 10000 | 1 | 2 | 2 | 37 | 0 | 0 | 0 | 0 | 2 | 2 | 8755 | 8158 | 7540 | 8164 | 6963 | 5923 | 1167 | 1022 | 1036 | 0 | 2700 | 0 | 2 |
| 8 | 10000 | 1 | 2 | 2 | 46 | 0 | 0 | 2 | 2 | 2 | 0 | 4073 | 6394 | 6143 | 6908 | 6652 | 6785 | 2400 | 0 | 871 | 0 | 244 | 251 | 2 |
| 9 | 10000 | 1 | 3 | 2 | 23 | 0 | 0 | 0 | 0 | 0 | 2 | 6974 | 7838 | 9002 | 9182 | 9729 | 9411 | 1134 | 1298 | 478 | 847 | 0 | 175 | 2 |